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The Main Diagonal of a Permutation Matrix 

Marko Lindner and Gilbert Strang 



Abstract 

By counting l's in the "right half" of 2w consecutive rows, we locate the main diagonal of any doubly infinite 
permutation matrix with bandwidth w. Then the matrix can be correctly centered and factored into block- 
diagonal permutation matrices. 



Part II of the paper discusses the same questions for the much larger class of band-dominated matrices. The 
main diagonal is determined by the Fredholm index of a singly infinite submatrix. Thus the main diagonal is 
■ determined "at infinity" in general, but from only 2w rows for banded permutations. 
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This paper is about banded doubly infinite permutation matrices. These matrices represent 
permutations of the integers Z, in which no integer moves more than w places. The banded 
matrix P has zero entries Pij whenever \i — j\ > w, and it has exactly one entry = 1 in each 
row and column. So P has 2w + 1 diagonals that are possibly nonzero, and exactly one of these 
deserves to be called the main diagonal. The first objective of this paper is to find the main 
^ ■ diagonal of P. 

The remarkable fact is that the correct position of the main diagonal is determined by any 2w 
consecutive rows of P. This will be Theorem 1. Normally that position can only be determined 
"at infinity" . 

For a much larger class of biinfinite matrices A, we form the singly infinite submatrix A + 
from rows i > and columns j > of A. The plus-index of A is the usual Fredholm index of 
A + , computed from the dimension of its nullspace and the codimension of its range: 

index+(A) = index(A+) = dim N(A + ) - codim R(A + ). (1) 

For any Fredholm operator, including all permutations A = P, those two numbers are finite. 
Then the main diagonal of A is determined by k = index + (v4). It is found k diagonals above 
(or —k diagonals below) the zeroth diagonal of A. So our problem is to compute that index. 

Example 1.1 The doubly infinite forward shift S has nonzero entries S^-i = 1 for i G Z. Its 
singly infinite submatrix S + is lower triangular, with those ones along the first subdiagonal. The 
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nullspace of S+ has dimension = (independent columns of S+) but the range has codimension 
= 1 (it consists of all singly infinite vectors with a zero in the first position). Thus k = 
index + (6') = — 1 and the main diagonal of S is correctly located: It is one diagonal below the 
zeroth diagonal of S (and it contains the ones). 

Since S is a permutation of Z with bandwidth w = 1, Theorem 1 says that index + (S') can be 
found from any two consecutive rows i and i+1 of S: "Count the ones in columns i + 2, . . . 
and subtract w. n The result — 1 agrees with index + (S'). □ 

We can directly state Theorems 1-3, for banded permutations P of Z. Two proofs are given 
for Theorem 1. Then Part II of the paper discusses the much larger class of doubly infinite 
band-dominated Fredholm operators A. The plus-index of A (and thus its main diagonal) is 
well defined but not so easily computable. 



Theorem 1 If P has bandwidth w, its plus-index is determined by rows 1, . . . , 2w and columns 
j > w. Subtract w from the number n of ones in this submatrix. The result k = n — w is the 
plus-index of P, and the main diagonal of P is k diagonals above the zeroth diagonal. 

The same result k comes from rows j* —w to j* + w — 1 and columns > j* , for any j* G Z. 
The submatrix is shown in (13) below. 



Equivalently, P c = S K P is a centered permutation of Z, by which we mean that its zeroth 
diagonal is the main diagonal. Our first proof will show how the submatrix with 2w rows was 
discovered. The second proof goes directly to the index k. 



Theorem 2 Every centered permutation P c of Z with bandwidth w can be factored into a 
product of two block- diagonal permutations: 
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All blocks have size 2w and each Ci is "offset" between Bi and B i+ i (shifted by w rows and 
columns, as shown). 



Theorem 3 The centered permutation P c can be further factored into N < 2w block-diagonal 
permutations of bandwidth 1 : 

P c = F 1 F 2 ---F N (3) 



Each factor Fj has block size 1 or 2. Thus Fj exchanges disjoint pairs of neighboring rows. 
By Theorems 1-3 the original P is factored into 



S~ K P C = S- K BC = 5-«FiF 2 



N- 
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2 The index and the plus-index 



We say that an infinite matrix A = (ajj)ijez is invertible if the linear operator that it represents 
via matrix- vector multiplication is invertible as an operator from £ 2 (Z) to £ 2 (7*). A bounded 
linear operator A from £ 2 into £ 2 is invertible iff it is both injective (its nullspace N(A) = {16 
£ 2 : Ax = 0} consists of only) and surjective (its range R(A) = {Ax : x G £ 2 } is all of £ 2 ). 
Deviation from both properties is measured in terms of the two integers 

a = dim N(A) and /3 = codim R(A). (4) 

A is a Fredholm operator if both a and {3 are finite. Then the Fredholm index (or just the 
index) of A is the difference 

index(A) = a - 0. (5) 
Unlike the separate numbers a and j3, their difference a — (5 has these important properties: 

• the index is invariant under compact perturbations A + K; 

• the index of a product obeys the remarkable formula index (AB) = index(A) + index(5); 

• the index is continuous with respect to the operator norm of A (and therefore locally 
constant); and 

• all finite square matrices have index zero. 

All Fredholm matrices of index zero, including all finite square matrices, obey the Fredholm 
alternative: Either A is injective and surjective (a = 0, (3 = 0) or A is not injective and not 
surjective (a ^ 0, /3 ^ 0). This is the set of all operators A — C + K with C invertible and 
K compact. See e.g. [7, 10] for a nice introduction to Fredholm operators. The property of 
being invertible on ^ P (Z) and the index itself are independent of p G [1, oo] for a certain class 
of infinite matrices (with uniformly bounded entries and summable off-diagonal decay), see 
[12, 14]. 

Our interest in the index (more precisely, the plus-index) of a biinfinite matrix originates 
from the following natural question: 

Which diagonal is the main diagonal of a biinhnite matrix? 

For a symmetric matrix, the zeroth diagonal is the main diagonal. For a Toeplitz matrix (a 
polynomial in the shift S), a zero winding number is the key. A wider class of structured 
matrices was analyzed by de Boor [8]. For Fredholm operators in general, Israel Gohberg's 
diplomatic answer to this question was that every diagonal has the right to be the main diagonal 
[3, p. 24]. But there are concrete problems waiting for a concrete answer. Here are two such 
problems: 

1. For finite and semiinfinite matrices, the inverse of a lower triangular matrix is again lower 
triangular. This may fail if A is biinfinite (essentially because the 'wrong' diagonal is 
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mistaken for the main diagonal). The inverse of the lower triangular shift S is the upper 
triangular backward shift S T . Shifting S one row up (treating the ones as the main 
diagonal) resolves this conflict. 

Here is a slightly more sophisticated example: A = S — |S* 2 is lower triangular with its 
inverse neither lower nor upper triangular. Shift up by one row: B = S _1 A = I — |S* 
is lower triangular with B~ x = I + + \S 2 + • • • also lower triangular (as we want). 
Shift up one more row: C = S~ 2 A = S^ 1 — |7 is now upper triangular with C^ 1 = B _1 S 
lower triangular. In this example the diagonal consisting of all ones should be the main 
diagonal, and B is centered (even though triangular) so that its inverse is triangular of 
the same kind. 

2. The numerical solution of biinfmite systems Ax = b can approximate A -1 by the inverses 
of finite square submatrices (finite sections) of A. Their upper left and lower right corners 
lie on the main diagonal of A. But again, which diagonal should that be? For A = 
2S 2 + — I + 2S" -1 , one can show [4, 5] that it has to be the diagonal that carries all 
the i's. Finite sections that are centered along one of the other nonzero diagonals are 
invertible (for sufficiently large sizes) but their inverses do not converge (they blow up). 



Problem 1 and 2 are not unrelated: If A is lower triangular and the inverses of its finite sections 
(which are also lower triangular) approximate A~ l , then A~ l will be lower triangular, too. 
A similar argument can be used to transfer Asplund's theorem [2, 25] to infinite matrices, 
explaining the relations between ranks of submatrices of A and A~ l . We demonstrate this in 
Part II of this paper. 

The answer to both problems is the same: Shift A down by k rows, or equivalently by k 
diagonals, where k is the plus-index of A [17]: 



k = index + (A) = index(A_ 



and 



.4, 



/ an a 12 

^21 ^22 



\ 



(6) 



A + is a semiinfinite submatrix of A. This shifting process is called index cancellation [9, 11]. 

The centered matrix A c = S K A has index + (yl c ) = (see (16) below). This is necessary for 
a triangular matrix to have a triangular inverse of the same kind (Corollary 6.4). It is also 
necessary for the convergence of the inverses of the finite sections to A" 1 (see [15, 20]). 

How is this plus- index k computed? In [17] it is shown that k is invariant under passing 
to a "limit operator" [18, 13, 6] of A at +oo, which often simplifies the computation. For the 
plus-index of a permutation P, the limit operator is not needed and the new formula (12) is as 
simple as possible. 



4 



Part I 

Permutation matrices 



3 The plus-index of a biinfinite permutation matrix 

Now we come to the problem of computing the plus-index of a biinfinite permutation matrix. 
So let 7r : Z — > Z be a permutation (a bijection) of the integers and put P = (Pij)ijez with 



Pij 
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1 if j = tt(2), 
i£j^ir(i). 



The matrix P is banded iff the maximal displacement w of any integer via n is finite: 

w := sup \i — 7r(z)| is the bandwidth of P. (7) 



Every permutation matrix P is invertible. So the submatrix P + is always Fredholm (see 
Lemma 5.2 below) and it makes sense to ask for its index: the plus-index of P. 

It seems a bit arbitrary to define the plus-index of a biinfinite matrix A based on the 
submatrix A + that starts at the particular entry an. Lemma 3.1 shows that for banded 
permutations, the submatrix A/, starting at a^k gives the same plus-index for every k G Z: 



-4, 



[ a ij)iJ=k 



' 0>k,k CLk,k+l 

a>k+i,k Q>k+i,k+i 



\ 



(8) 



Lemma 3.1 If P is a banded biinfinite permutation matrix and k G Z then index + (P) = 
index(Pfc) holds independently of k. 



Proof. Let fceN first. Because the first k rows and columns of Pq contain only finitely many 
nonzero entries, we have the following equality modulo finite rank operators: 




Consequently, index(P ) = index^^). For k G Z \ N the argument is very similar. ■ 

We will write P + for any of these singly infinite submatrices J\. Notice that P + can be 
the zero matrix (not Fredholm) when P is not banded. An example is the permutation that 
exchanges every pair i and —i, for i G Z. 

Here is a concrete formula for the plus-index of permutation matrices that "split" : 
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Theorem 3.2 Let n be a permutation of the integers (not necessarily banded) and denote the 
corresponding matrix by P. Suppose there exist i* and j* such that 



Then index + (P) = j* 



{%({) : i<i*} = {j : 

* 


j < j*}- 




* and j* exist then P decouples into two blocks: 
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(10) 



11^ 



The meeting point may not fall on the zeroth diagonal of P (in fact, it falls on the main 

diagonal). Because P is invertible, P^ and P^> are invertible. It is now easy to show that 
j* — i* is the plus-index of P: 

Case 1. If i* < j* then, putting k := i*, we have that starts with j* — i* zero columns. 
Those are followed by P^ 2 \ so that the index of Pk is (j* — £*) — = j* — i*. 

Case 2. If i* > j* then, putting k := j*, we have that Pk has i* — j* zero rows, followed by 
P( 2 \ Again the index is — (i* — j*) = j* — i*. 

It remains to apply Lemma 3.1. ■ 

Adding k = j* — i* to all row numbers moves the meeting point of the blocks in (11) to 
position (J*,j*). In fact index + (A) = j* — i* holds for all invertible matrices A (not just 
permutations) that decouple in the sense of (11). 

The first question is whether or not such a splitting will appear in every permutation of Z 
- and the unfortunate answer is no! 

Example 3.3 (symmetric and intertwined) Let us depict this permutation by a graph 
with vertex set Z and with a directed edge (an arrow) from i to j iff j = 7r(«). 



Z 



1 f 



. . . . . . . . . . 



This permutation is symmetric (vr(i) = j iff ir(j) = i). In such a case, splitting happens iff the 
graph falls into separate components {i < i*} and {i > i*}. Then the graph can be split in two 
without cutting any edge, which is obviously impossible in our example. 

Of course, this symmetric sitation means that P = P T and hence P + = (P + ) T so that 
index(P + ) must be zero - and we don't need help from Theorem 3.2. But here is a non- 
symmetric version of essentially the same example: □ 



Example 3.4 (previous example, shifted) 7r'(z) := ir(i) + 1. 

tf=t 



z 



f 



f 



r 



t 



Without symmetry it is not so easy to see that splitting is impossible. In fact, this graph is 
disconnected (it has three components) but it cannot be split. □ 

There is a simple trick to make splitting immediately visible in every permutation's graph. 
We will demonstrate this trick for a simpler example 3.5. 

Example 3.5 Let ir(i) := i + 1. The corresponding matrix is our shift, S. 



Z 



The graph is clearly connected but we have a split at every single position i*, with j* = i* + 1 
in (10). This is more easily seen by depicting the i's and the j's separately, where we draw an 
arrow from i to j if j = ir(i)\ 



I : 



1 2 3 




□ 



In general, splitting (10) is equivalent to a separation like this: 




The nodes {i < i*} are only connected to {j < j*} and the nodes {i > i*} only to {j > j*}. By 
shifting the i-axis accordingly, one can straighten the cut to perfectly vertical. This is exactly 
what index cancellation does. Shifting the i-axis corresponds to renumbering the rows of our 
matrix. The plus index j* — i* is obvious if one thinks in terms of these i-j-graphs rather than 
the matrix. The conclusion that there is no split in Example 3.4 is now a simple exercise. 
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How do we compute the plus-index when the graph does not split? Here is our agenda: 

• By changing finitely many arrows, we construct a new permutation n' that does split. 

• This is possible with the split at any given position j*. 

• Changing finitely many arrows only changes finitely many matrix entries. Therefore n 
and 7r' have the same plus- index, and we know that the index for n' is j* — i*. 

Step 1: Delete arrows. Choose an arbitrary position j* G Z where the split should cross 
the j-axis. First delete the 2w arrows i t— > with i = j* — w, j* + w — 1 from the graph 
of 7r. Then the remaining diagram will have a big gap: 



j*-W-l j*+W 




The endpoints of the 2w deleted arrows are now empty. Put those endpoints (the column 
numbers) into ascending order. Suppose n of those column numbers are greater than or equal 
to j* - their endpoints are to the right of the split. The other 2w — n endpoints are to the left. 

Step 2: Rewire. Insert 2w new arrows connecting the empty starting points i = 
j* — w, + w — 1 to the (ordered!) empty endpoints. 




The new permutation 7r' splits at i* — j* + w — n. If P' is the matrix for tt', then P' + — P + is 
of finite rank. Therefore P + and P\ have the same index. This index comes directly from the 
splitting point of tv': 

k = index + (P) = index(P + ) = index(P^_) = j* — i* = n — w. (12) 

Remark 3.6 We are just reordering the l's in 2w consecutive rows of P. In the new order, 
those l's go from left to right. The figure has w = 3 and there are n = 4 ones on the right. 
Then in this example the index is k = A — 3=1. □ 
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The number of l's in columns j > j* is n. Thus we are counting the l's in the following 
sub matrix R, and subtracting the bandwidth w to obtain the index K — n — w: 



R= : ■-. : (13) 

y Pj*+w—ij* * ' ' Pj*+w-i,j*+2vj-i ) 

This submatrix R is lower triangular because P has bandwidth w. This completes our first 
proof of Theorem 1. 

To some extent this result is surprising. The Fredholm index is robust under perturbations 
in finitely many matrix entries - so it must be encoded somewhere deep down at infinity. (This 
is exactly where the limit operator approach [18, 13, 6], that we mentioned before, enters the 
stage.) But for banded permutation matrices, the plus- index can be computed from a finite 
submatrix - independent of the position. This is clearly a consequence of the 'stiff' rules 
(exactly one 1 in each row and column - and never leave the band!) that these matrices have 
to follow. 

Second proof of Theorem 1. The first proof identified the "right half" of any 2w consec- 
utive rows of P, as sufficient to determine the index of P + . Now we can give a direct proof by 
counting the l's in that right half R. In the figure below, the infinite submatrix P + is marked 
out by double lines, starting on the zeroth diagonal (marked with x's). We divide R into a 
part R\ outside P + and a part R2 inside P + . Now count the l's in each part : 



w rows 


• 

• • • 


w rows 


x • • • 

• x • • • R 2 

• • x • • • 







Each 1 in Ri will mean a zero column below it in P + . The other columns of P + are complete: 

dimension of nullspace of P + = number of l's in R\. 

Any zero rows in P + will be zero rows in R 2 . Those are counted by the l's that are missing 
from the w rows of R 2 : 

codimension of range of P + = w — (number of l's in R 2 ). 

Subtraction gives the index formula in Theorem 1: 

index of P + = (number of l's in R) — w = n — w. ■ (14) 



4 Factorizations of banded permutations 

This short section establishes the factorizations P c = BC and P c = F\F 2 ■ ■ ■ Fn of centered 
permutation matrices P c (plus-index equal to 0). The block- diagonal permutations B, C have 
block size 2w and iq, • • • , Fn have block size 1 or 2. 
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Both factorizations are particularly simple cases of a more general theorem [22, 23] for 
banded matrices with banded inverses. For permutations, P and P~ x = P T have the same 
bandwidth w. Figure 4.1 below shows a typical matrix P c with w — 2. Each non-empty square 
of size 2w = 4 contains two l's (by Theorem 1), because P c is centered (k = 0). 

Step 1. There is a 1 in each row and column. Row exchanges from the block B± 1 will 
move the l's in the first square into the top two rows (and thus l's go into the last two rows of 
the second square). The block B^ 1 acts in the same way on the next four rows of P c . All the 
circled entries ® have become zero. 

Step 2. Cf 1 executes column exchanges to move the 2 + 2 = 4 ones in its columns to the 
main diagonal (marked by x). When all blocks B^ 1 and Cf 1 act in this way, we reach the 
identity B~ 1 P C C~ 1 = I and hence P c = BC. This is Theorem 2. 





Co 1 


cr 1 






multiply by B 1 1 


• • x • 

• • X 

® ® 
® 


® 

® ® 

X • • 

• X • • 






multiply by B^ 1 




• • X • 
• • X 

® ® 
® 


® 

® ® 

X • • 
• X • • 















Figure 4.1: Each square has rank 2. B 1 1 and B 2 1 produce zeros in the circled positions by row exchanges. 
Then Cj -1 produces (by column exchanges) the 4 by 4 identity matrix with ones in the diagonal positions x. 

A similar factorization was established in [21] for any banded matrix with banded inverse. 
The same result was found for unitary banded matrices in [17], with a different proof. It appears 
that 2w is the right block size for these block-diagonal factorizations A = BC. 

Theorem 3 says that it is possible to factor P c into even simpler block-diagonal permutations 
F X F 2 ■ • • Fn- Now the blocks of each factor are 2 x 2 or 1 x 1. Thus each F{ exchanges a set 
of disjoint pairs of neighbors (as in the bubblesort algorithm, but with disjoint exchanges in 
parallel for much greater efficiency). 

A beautiful proof that N < 2w was given by Greta Panova [16, 1, 23]. We won't repeat the 
details. Her key idea is a variation on the "wiring diagram" of a permutation. 

Briefly, that diagram connects each point (0,z) for i G Z by a straight line to the point 
(1, 7r c (?')). The intersections of those lines tell us the order in which to exchange neighbors. Key 
point: All intersections lie on iV < 2w vertical lines in the [16, 23]. Exchanges on each vertical 
line can be executed in parallel by a matrix Fi (its bandwidth is 1). Then P c is the product 
F 1 F 2 ■ ■ ■ F N with N < 2w. 
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Part II 

Band-dominated matrices 



5 The plus-index of a band-dominated matrix 

A is a band matrix if it is supported on finitely many diagonals only, and it is a band-dominated 
matrix if it is the limit (in the £ 2 — > £ 2 operator norm) of a sequence of band matrices. 

A k is the semiinfinite submatrix (8) consisting of rows i > k and columns j > k. Then 
Lemma 3.1 generalizes from permutation matrices to band-dominated matrices that are Fred- 
holm: 

Lemma 5.1 If A is band-dominated and Fredholm then the index of Aj, is independent of 

kez. 

Proof. If A is a band matrix then the proof is literally that of Lemma 3.1 above. If A is 
band-dominated (i.e. in the closure of the set of band matrices) then the identification (9) still 
holds modulo compact operators, which is enough to prove the claim. ■ 

In analogy to A + = A\ we introduce the semiinfinite matrix v4_ that ends at a§$. 



modulo compact operators since A_ + and A + _ are compact. (Those off-diagonal blocks have 
finite rank if A is a band matrix and they are norm limits of finite rank matrices, hence compact, 
if A is band-dominated.) From this identification (15) we immediately have Lemma 5.2: 

Lemma 5.2 [17] A band- dominated biinfinite matrix A is Fredholm iff both A + and A^ are 
Fredholm. In that case 

index(A) = index(A + ) + index(v4_). 

So if A is invertible then index(74 + ) = — index(A_). Clearly, all results that we prove for the 
plus- index have their counterpart for the minus-index, index_(A) = index(A_). Here is another 
important result on the plus-index: 

Lemma 5.3 If A and B are band- dominated and Fredholm biinfinite matrices then 



( '■■ 

A- ■= (a-ij)ij=-oo = • ■ ■ a-i,-i 

\ ■ • • o _i 

Then A_ and A + will together determine the index of A: 



a-i.o 
ao,o / 




(15) 



index + (AS) 



index + (yl) + index + (i?). 
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Proof. By (15) we have, modulo compact operators 
AB = 



A. 











B_ 








B + 



A_B_ 








A + B + 



so that (AB) + = A + B + , whence index((AB)+) = index (A + 5+) = index(A + ) + index(E + ). ■ 
Recalling the forward shift S with index + (S') = — 1 = — 1, Lemma 5.3 yields 

index+^A) = k ■ index+(S') + index + (A) = -k + index+(A). (16) 
Then if k = index+(A), index cancellation A c := S K A indeed leads to plus-index zero. 



6 Triangular matrices 

The plus-index of a general band-dominated matrix A is not easy to compute. We will assume 
that A is banded and invertible (as a bounded operator on £ 2 (Z)). One possible approach (far 
from complete in this paper) is to factor A into a lower triangular L and an upper triangular 
U, with a banded permutation P in between: 

A = LPU = (lower triangular) (permutation) (upper triangular). 

This extends the factorization that comes from Gaussian elimination on a finite invertible 
matrix. Notice the "Bruhat convention" that places P between L and U. In that position P 
is unique. When P is a finite matrix, the l's are all determined by the ranks of the upper 
left submatrices of A. Elimination proceeds as normal (subtracting multiples of the pivot rows 
from lower rows) except that we wait to the end to reorder the pivot rows into U by using P. 

The steps are described in [24], where the main purpose is to extend A = LPU to biinfinite 
matrices A (banded and invertible). The factors L,P,U have bandwidth < 2w. But the 
elimination process has to be reconsidered, since it can no longer start at an (which is not the 
first entry, it is in the center of the matrix). Briefly, we look at the upper left submatrices 
Ak- (singly infinite) containing columns < k and rows < k + w. By Fredholm theory, the 
index of Ak- is independent of k. Its columns are independent because A is invertible (and all 
nonzeros survive into Ak-)- Any dependent rows of Ak- are among the last 2w rows (for the 
same reason). So the minus- index of A and the location of its main diagonal are determined 
by the number d of dependent rows. 

Since d does not depend on k, one row changes to independent and a new dependent row 
appears (if d > 0) when k increases to k+1. That newly independent row is the pivot row in the 
following elimination step. Dependence involves all the earlier rows of A^-, so this elimination 
process is not constructive — at least not in the usual sense. 

Note: We say that an infinite set of vectors {vi} i£ i with I C Z is linearly dependent if there 
is a non-zero sequence (cj) ig n G £ 2 (J) such that c%V{ = 0. In that sense, the columns of an 
infinite matrix form a linearly independent set iff that matrix is injective as an operator on £ 2 . 

Returning for a moment to permutation matrices, rows of are dependent when they are 
zero. They are independent when they contain a 1. Therefore the number d of zero rows (in 
the left half of any 2w rows of P) involves the same count of l's as in Theorem 1 (in the right 
half of those rows). 
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Now suppose that A is invertible and banded, with A = LPU, where all three factors have 
bounded inverses. (This can fail even for block-diagonal matrices A with 2x2 orthogonal 
blocks. The upper left entries in those blocks can approach zero.) As usual, L and L~ x are 
lower triangular, P is a permutation, and U and U~ l are upper triangular. Then P contains 
all information about indices and the correct position of the main diagonal. 



Lemma 6.1 The plus-index of A equals the plus-index of P (and that is easily computable from 
P). Also the minus-indices are equal. 















) o- 





Proof. As shown in [24], L and U are both banded, with bandwidth < 2w. By Corollary 6.4 
below, index + (L) and index + ({7) are zero. It follows from Lemma 5.2 that also index„(L) and 
index_(i/) are zero. Now use the key property that L and U are triangular: 

A_ A_ + \ _ ( L- 
A+„ A + J ~ V L+- L + 

The upper left block A_ immediately factors into 

A_ = L_P_U„. (17) 

By the formula for the index of a product we get 

index(A_) = index(L_) + index(P_) + index (i7_) = index(P_) (18) 

and then index + ( J 4) = — index_(v4) = — index_(P) = index + (P), again by Lemma 5.2 since A 
and P have index zero (they are invertible). ■ 

Alternatively, one can apply Lemma 5.3 directly to A = LPU to get 

index + (A) = index + (LPU) = index + (L) + index + (P) + index + ([7) = index + (P) 

and then conclude index_(A) = index_(P) from Lemma 5.2. 

We continue with some results on lower triangular biinfinite matrices and their plus-index 
(which locates their main diagonal). It is not surprising, but however in need of a proof, that 
the main diagonal of a lower triangular matrix has to be in the lower triangle. This is what we 
show now. So let A be a biinfinite matrix. 



Lemma 6.2 If A is lower triangular and Fredholm (but not necessarily band- dominated) then, 
for sufficiently large k, the submatrix A k from (8) is injective (as an operator on £ 2 (N)). 



First proof. If A is Fredholm and a = dim(N(A)) then there is a set J = {ji,---,j a } of 
integers such that the columns of A whose number is not in J form a linearly independent set. 
Let k be bigger than max(J). Then all columns j > k of A and hence (because A is lower 
triangular) all columns of the semiinfinite submatrix A^ are linearly independent. So Aj, is 
injective. ■ 
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Second proof. Since A is Fredholm on £ 2 , there are operators B and K such that BA = I+K 
with B bounded and K compact on I 2 [7, 10]. For k G Z, let I k denote the operator I 2 — > £ 2 
that puts all entries x\ of x G £ 2 with z < to zero and leaves all x\ with i > k unchanged. (I k 
is (8) for A = I.) Since A is lower triangular, we have IkAIk = Alk for all k G Z. Moreover, 
||/fcx||^2 — t- as A; — > +oo for all x G £ 2 . By compactness of K, it follows [10] that the operator 
norm goes to zero as k — > +oo. So fix k G Z large enough that ||ifci^|| < 1. Then 

C = I + Ifcif is invertible (by Neumann series). Now 

I k BI k AI k = I k BAI k = I k (I + K)I k = h + I k KI k = (I + I k K)I k = CI k , 

so that C~ l I k BI k AI k = I k . Now take x from the range of I k such that I k AI k x = 0. Then 
= C~ 1 I k BI k AI k x = I k x = x, so that x = is the only x in the range of I k with I k AI k x = 0. 
By I k AI k \n(i k ) = A k this means that the only solution y = {y.i)°l k of A k y = is the trivial 
solution y = 0. ■ 

Corollary 6.3 If A is band-dominated, lower triangular and Fredholm then index + (v4) < 0. 

Proof. By Lemma 6.2, the operator behind A k has nullspace {0} for all sufficiently large k G Z, 
so that index(Afc) = — j3 < 0. The claim now follows from Lemma 5.1. ■ 

So indeed, the main diagonal of a lower triangular matrix is in the lower triangle. By simple 
translation via S d , the main diagonal of a matrix that is zero above the d-th diagonal must 
be on or below that d-th diagonal. By passing to the adjoint matrix, one can write down an 
analogous statement for upper triangular matrices and their translates and then combine the 
two: The main diagonal of a band matrix must be in that band. 

Consequently, tridiagonal matrices have plus-index n = —1, or 1. Using this fact and 
our Theorem 1, it is easy to see that the only tridiagonal permutation matrices that are not 
centered (i.e. they have a nonzero plus-index) are the shifts S and Indeed, suppose P is a 
biinfinite permutation matrix with bandwidth w — 1 and plus-index k — — 1. Then, for every 
j* G Z, the number of ones in the 2x2 matrix (13) is n = k + w = 0. So the 1 in row j* must 
be in column j* — 1, whence P must be S. Similarly, w — 1 and k = 1 leads to P — S" 1 . 

Here is another consequence of Corollary 6.3: 

Corollary 6.4 If A is band- dominated, lower triangular and invertible with a lower triangular 
inverse then index + (A) = 0. 

Proof. By Corollary 6.3, we have k = index + (A) < and A = index + (A~ 1 ) < 0. But then 
k = —A by Lemma 5.3, so that k must be zero. ■ 

The latter shows how Problem 1 from the beginning of our paper is related to the plus-index. 
After discussing Problems 1 and 2, we mentioned that they are related: If the finite section 
method applies to A then a lower triangular A will have a lower triangular inverse. Now we 
discuss an amazing extension of this statement, coming from the following theorem (for finite 
matrices) by Asplund [2] : 
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Theorem 6.5 Let A be an invertible matrix and fix two integers p and k. Then the following 
are equivalent: 

(i) All submatrices B above the p-th superdiagonal of A have rank(I?) < k. 
(ii) All submatrices C above the p-th subdiagonal of A' 1 have rank(C) < p + k. 

See [25] for discussion and a new proof. With p = and k = 1 we get the familiar statement 
that lower triangular matrices have lower triangular inverses. With p = 1 and k = 1, one can see 
that all submatrices above the first subdiagonal (and similarly: below the first superdiagonal) 
of A -1 have rank < 2 if A is tridiagonal. 

Let us attempt to transfer Asplund's theorem to singly and doubly infinite matrices. Our 
assumption will be that the finite section method (short: FSM) applies to the infinite matrix 
A. Here is again what that means: 

The finite sections of A are square submatrices A n whose upper left and lower right corners 
lie at positions l n and r n on the zeroth diagonal of A. The sequences l n and r n go to — oo and 
+oo respectively, except when A is only semi-infinite - then l n is fixed at 1 and r n goes to 
+oo. One says that the FSM applies to A if: A is invertible, the matrices A n are invertible for 
sufficiently large n, and their inverses converge strongly to A~ l . This implies that A is centered 
(see [15, 20]). Also note that strong convergence (A~ x x — > A~ x x for all x) implies entrywise 
convergence of the matrices A' 1 to A -1 . See e.g. [13, 15, 18, 19, 20] and the references therein 
for more on the FSM. 

Under this (reasonable) assumption, we now extend Asplund's Theorem 6.5 to a singly or 
doubly infinite matrix A (not necessarily band-dominated): 

Proof. Suppose (i) holds, i.e. all (finite and infinite) submatrices B above the p-th superdiag- 
onal of A have rank(B) < k. To show that (ii) holds, let C be an arbitrary (finite or infinite) 
submatrix above the p-th subdiagonal of A -1 . We show that rank(C) < p + k. 

If C is infinite then rank(C) is the supremum of all rank(C') with C going through all 
finite submatrices of C. (In particular, rank(C) = oo iff the set of all rank(C') is unbounded.) 
Therefore it is enough to show that rank(C') < p + k for all finite submatrices of A~ l above 
the p-th subdiagonal. So we can assume that C is a finite matrix. 

Let li, l 2 , ... and r 1; r 2 , ... be the cut-off positions for the (by our assumption applicable) finite 
sections of A. For all sufficiently large n (say n > N) the interval [Z n ,r n ] contains the row and 
column numbers in which C is positioned at A~ x . By applicability of the FSM, the inverses 
of our finite sections A n of A converge entrywise to A" 1 . Let C n denote the submatrix of A^ 1 
that is at the same position as C is in A' 1 , so that C n -> C entrywise as n — > oo. 

Now we apply Theorem 6.5 to the finite matrix A n : All submatrices above the p-th su- 
perdiagonal of A n have rank < k, by (i), because they are submatrices of A (above the p-th 
superdiagonal). So C n , being a submatrix above the p-th subdiagonal of A' 1 , has rank < p + k. 
And this is true for all n > N. For finite matrices, entrywise convergence is convergence in all 
matrix norms, so that \\C n — C\\ — > 0. Rank is a lower semi-continuous function with respect 
to the matrix norm, so 

rank(C) < liminf rank(C„) < p + k 

71— >OQ 

and we are done with (i) (ii). The other direction is checked analogously. ■ 
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Remark 6.6 Our proof shows that applicability of the FSM is sufficient for Asplund's theorem 
to hold for an infinite matrix. To see that it is not necessary, go back to the permutation 
matrix P in Example 3.3. Asplund's theorem can be seen to hold for this matrix (note that 
P- 1 = P T = P) but the FSM does not apply: No one of the finite sections is invertible because 
there are no places l n and r n to cut without breaking one of the links in this permutation graph. 
□ 
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